The Distance Ladder 


There are two classes of distance measurements: those which 
produce distances that are independent of other methods, and 
those that are part of the distance ladder. Most of the measure- 
ments you are familiar with fall in the latter category. Below is a 
schematic of the distancer ladder, with blue representing meth- 
ods that work primarily (or exclusively) in Pop I systems, red for 
methods that work in old populations, and brown for geomet- 
ric techniques that lie outside the ladder. The PNLF techniques 
works in all (large) galaxies, and is in green. 
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The Base of the Distance Ladder 


At the base of the distance ladder are two geometric techniques: 
parallax (denoted by the symbol for parallax angle, 7), and the 
moving cluster method (which is most applicable to the Hyades). 
The former relies on the earth’s movement about the sun to pro- 
vide a surveyor’s baseline; the latter uses the space motions of 
co-moving groups of stars. (The angle between the cluster’s con- 
vergent point and its location gives you ratio of radial velocity 
to tranverse velocity; this ratio then translates into a distance 
via the proper motion.) Unfortunately, these techniques are only 
good for objects closer than a few hundred parsecs. Although a 
couple of hundred thousand stars are in range, most are normal 
solar-metallicity G,K, or M main sequence stars. In fact, there 
are very few “interesting” targets within reach of the geomtrical 
techniques. Thus, to measure distance, we must build a ladder. 





Distance Number within 


Type of Object (pc) Distance 
Open Clusters 200 5 
Metal-Poor Main Sequence 200 x 10 
Cepheid Variables 500 8 
RR Lyrae Variables 500 1 
Horizontal Branch Stars 600 eal 
Red Supergiants 500 0 
Blue Supergiants 500 1 
Planetary Nebulae 500 10 
Cataclysmic Varibles 350 29 


White Dwarfs 500 many! 
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Proper motion vectors for the Hyades stars from the classic paper 
by van Bureren in Bull. Astron. Inst. Netherlands XI (1952). 
Note that the vectors appear to converge at a point about 20 
degrees from the center of the cluster. 


Interstellar Extinction 


[Trumpler 1930, Lick Obs. Pub. 420] 

[Sandage 1973, Ap.J., 183, 711] 

[Burstein & Heiles 1982, A.J., 87, 1165] 

[Burstein & Heiles 1984, Ap.J. Suppl., 54, 33] 
[Schlegel, Finkbeiner, & Davis 1998, Ap.J., 500, 525] 


Any standard candle measurement must take interstellar extinc- 
tion into account. Extinction is present, even when the dust is 
not obvious. This was first proved by Trumpler in 1930, when 
he showed that the mean (linear) size for open clusters seemed 
to increase with distance. (In other words, the distances to these 
clusters were being overestimated, due to dust.) Since then, there 
have been a number of attempts to parameterize the dust distri- 
bution in the Galaxy. 











One approach by Sandage was to examine the BV R colors of 
bright galaxies as a function of galactic latitude. He then as- 
sumed that the Galactic dust is distributed in a fashion similar 
to that of a uniform disk of thickness 2@). In this case 
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Tate ae err? (17.01) 
If we change this into magnitudes, then 


—2.5log I = —2.5 log Ip + 2.5(log e)« £o esc b (17.02) 


If we let A = Klp(loge) = 1.086 k lo, then 
Am = Acscb (17.03) 
Sandage then modified this slightly 


Ap = 0.132(cscb— 1) for |b| < 50° 
=0 for |b| > 50° (17.04) 


Of course, by using a sample of bright galaxies, Sandage was 
already introducing a selection effect: the larger the extinction, 
the less likely the galaxy is to be in the sample 


Another approach which was taken by de Vaucouleurs, de Vau- 
couleurs & Corwin in the RC2 catalog, was to fit galaxy counts 
and radio data over the entire sky (using the underlying assump- 
tion that dust follows the gas). The result was a much more com- 
plicated function (too long to reproduce here). Today, most esti- 
mates of foreground Galactic extinction comes from two places. 





The first is an quantitative analysis of galaxy counts and H I 
emission over the entire sky. (Again, this assumes a constant 
dust-to-gas ratio.) These values are reproduced as a series of 
contour diagrams in Burstein & Heiles (1982) and tabulated for 
NGC and UGC galaxies in Burstein & Heiles (1984). The second 
is based on COBE measurements of thermal emission from the 
dust, and is described in Schlegel, Finkbeiner & Davis (1998). 
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FIG. 4(a}. A map of reddening in the galaxy for 10°<<65" and all galactic longitudes. These maps were generated by the H 1/GC method as 
described in the text. Regions of the map corresponding to declinations south of — 23° are less accurate than the rest of the map. The contour 
levels are labeled in units of 0.01 mag in E(B — V) in intervals of 0.03 mag. The contours start at 0.00, and the intervals corresponding to 0.09, 
0.24, 0.39, 0.54, and 0.69 mag of E (B — V ) are emphasized in heavy ink. Not all contours are labeled in this map owing to excessive crowding in 
parts. Closed contours with an underlined label indicate reddenings that are decreasing interior to the contour; otherwise contours indicate 


increasing reddening interior to the contour. Most geometric figures in this map are artifacts of the way the mapping program handles regions . 
without data. 


Wavelength Dependence of Extinction 


[Savage & Mathis 1979, A.R.A.A., 17, 73] 
[Krelowski & Papaj 1993, Pub. A.S.P., 105, 1209] 
[Bouchet et al. 1985, Astr. Ap., 149, 330] 
[Cardelli, Clayton, & Mathis 1989, Ap.J., 345, 245] 


The extinction due to dust varies with wavelength. In general, 
the shorter (bluer) the wavelength, the greater the extinction. 
Hence, the term “reddening”. ‘The precise wavelength depen- 
dence of extinction does vary from location to location in the 
Milky Way, and metal-poor galaxies appear to have different ex- 
tinction laws. However, in the optical, the logarithmic extinction 
is approximately A, « 1/X. Note that this is not Raleigh scat- 
tering, which goes as 1/A4. 
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Fig. 3. The SMC extinction curve from 2.2m to 130nm resulting from the 
present work (squares) and from Prévot et al, (1984) (triangles). The heavy line 
represents the galactic law [Savage and Mathis (1979), plus Koornneef (1983b) 
for the infrared] and the dotted line represents the LMC curve [mean of Nandy 
et al. (1981) and Koornneef and Code (1981) for the ultraviolet, and Koornneef 
(1982) for the infrared]. Encircled crosses at B and V wavelengths result from 
normalization 





There are three ways of quoting the amount of extinction to an 
object. The first is to quote the total extinction (in magnitudes), 
in some band, usually B (i.e., Ag). The second, which is used 
by people who study emission line objects, is to give c, the loga- 
rithmic amount of extinction at HG. But the most common way 
is to quote the differential extinction between two bandpasses, 
Dec 


E(B-V)=AB-AV 
= (Bobs Sj Bo) —_ (Vobs a= Vo) 
= (B-—V)o, — (B — V)o (17.05) 
Here, the subscript zero refers to the intrinsic magnitude of the 


object, and the subscript obs denotes the observed quantity. The 
extinction at any wavelength is then be calculated through 





A, =R E(B -V) (17.06) 


where R, is the ratio of the total to differential extinction. Some 
values of R are Rp = 4, Ry = 3, Rr = 2.3, and Rz = 1.5. 





The precise extinction properties of dust depends on the type 
of dust involved. For instance, dust can be silicon-rich (4.e., 
silicates), or carbon-rich. Also, observations in other galaxies 
suggest that the overall metallicity of the galaxy can affect dust 
properties. This is most pronouced in the ultraviolet, where low- 
metallicity galaxies have a rapidly increasing extinction curve. 
To account for this, Cardelli, Clayton & Mathis (1989) have pa- 
rameterized extinction via Ay, the ratio of total V extinction 
to differential extinction. Different values of Ay imply different 
extinction laws. 


The Extragalactic Calibrators — Pulsating Stars 


Pulsating stars, such as RR Lyr stars and Cepheids, are ex- 
tremely important for distance scale measurements. To under- 
stand why, let’s put aside the question of why stars pulsate (that’s 
a subject for a stellar structure class), and ask the question, if a 
star pulsates, what will be the period of its pulsation? 


Image a star that is unstable to pulsations. We’ll assume that 
about 1/2 of the star’s time is spent in the expansion phase, 
and the other half in the contraction phase. We’ll also make the 
assumption that the star’s change in radius is some percentage 
of the star’s average radius, i.e., dR x R. Now, suppose the star 
has just reached maximum. How fast can gravity restore the star 
to its normal state? In the case where the acceleration due to 
gravity is roughly constant, 





1 
R= sgt (17.07) 
The time, t, is a fraction of the star’s period of pulsation, so 
P œ t. Furthermore, the gravitational acceleration is simply 


GM/R?. So 
P? x (=) (17.08) 
M 


(Kelper’s law), or 
Bam (17.09) 





where (p) is the average density of the star. This simple equation 
is a remarkably good approximation for the period of all radially 
pulsating stars, and even some non-radial pulsators. 


You can also get at equation (17.09) in a different way. Con- 
sider that if a star’s interior undergoes a perturbation, that event 
will propagate through the star at the local sound speed, vs = 
\/yp/p, where p is the pressure. Therefore, the time it takes the 
star to adjust is 





1/2 
t=P x R/vs xR (£) (17.10) 


Now, if the star is in quasi-hydrostatic equilibrium 





dp GM 
E E 17.11 
dR a Re es) 


which, very roughly, integrates to 


p x Me (17.12) 


If you substitute this into (17.10), you again get 


R \ 1⁄2 R3\ 1/2 
P x Rol? (5) x< (Z) wla e (17.13) 


So whether you approach the problem via atmospheric free-fall, 
Kepler’s laws, or wave propagation, you get the same result. 


Let’s now put the equation for pulsation back in the form 


R? 
P? a 17.09 
x (5) (17.09) 
We know from the Stefan-Boltzmann law that 
L=AnR? oT, (17.14) 


so we can get rid of R in the equation. Thus, 


[3/4 


P x =~ 
3 
Tès p Mi? 


(17.15) 


or 
log P = 0.75 log £ — 3log Teff — 0.5log M + Q (17.16) 


where £ is the mean luminosity of the star, and Q is the constant 
of proportionality. Now, consider what this equation says: every 
star has a unique natural period for pulsation which depends on 
its luminosity (and temperature). 


Another way to look at the problem is to consider the HR di- 
agram, which is log £ vs. logTefp. In many respects, you can 
think of this diagram as the projection of a 3-dimensional di- 
agram, with axes of log £, logTeşfp. and M. However, there is 
nothing special about these variables; for instance, you can easily 
remove log £ (or log Teff) from the diagram, and replace it with 
R, through the Stefan-Boltzmann law. Or, equivalently, you can 
use (17.16) and replace any of the three variables with P. So, just 
as there are rules for the locations of stars in the HR diagram, 
there are rules for the positions of stars in log £-log T¢ ¢ f-log P 
space. 








RR Lyrae and Cepheids 


The two most commonly used standard candles among pulsating 
stars are RR Lyr variables and Cepheids. RR Lyr stars are old, 
primarily metal-poor objects that pulsate with periods 8 < P < 
20 hours, and amplitudes of S£ 1 mag. These stars began life on 
the main sequence with masses ~ 1Mo; they have subsequently 
evolved up the giant branch, lost some mass, ignited helium in 
the core, and settled on the horizontal branch, where they have 
a luminosity of ~ 50£6, and a mass of M ~ 0.6Mo. Because 
horizontal branch stars all have roughly the same luminosity, RR 
Lyr stars make excellent standard candles. (Note, however, that 
evolved horizontal branch stars should be more luminous than 
zero-age HB stars. Plus, there’s no guarantee that RR Lyr stars 
of different metallicity will have the same luminosity.) 
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The stellar evolutionary tracks of horizontal branch stars, and the 
RR Lyr instability strip. The tracks are different for other core 
masses and metallicities. As with the main-sequence, horizontal 
branch stars spend most of their life near their zero-age horizontal 
branch position. 


Cepheid variables are massive stars that have evolved up the gi- 
ant branch, ignited helium, decreased their luminosity (slightly), 
and entered into an intermediate phase before returning to the 
asymptotic giant branch. During this time, they make a loop in 
the HR diagram which passes through the instability strip. Be- 
cause Cepheids are Pop I objects, they are most often found in 
regions of active star formation, where the affects of interstellar 
dust are significant. Cepheid pulsation periods can be anywhere 
from a few days to a couple of months. 
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The evolutionary tracks of high-mass stars and the instability 
strip for Cepheid variables. 


The period of a Cepheid variable is a sensitive function of its ab- 
solute magnitude. To see this, consider that, like main-sequence 
stars, a mass-luminosity law exists for massive stars in the post- 
giant branch phase of evolution. So 


L x M| (17.17) 
where a ~ 4. When you combine that with the pulsation law 
P œ (p71 œx RAM (17.09) 


and the Stefan-Boltzmann law 


Lx R'Tipy (17.15) 
you get 
3a—2 n 
PL T (17.18) 
or (with a = 4), 
log L = 1.6 log P + 4.8 log Teff (17.19) 





Note that there is a strong temperature dependence, as well as 
a luminosity dependence. However, if you make the assumption 
that the instability strip is very narrow, then a direct relation 
can be found between log £ and log Teff, 





log L ~ 20 log Teff +C (17.20) 


The result is a simple log £-log P relation. 


Though elegant, the usefulness of the previous analysis is quite 
limited. While it does show that the calibration of Cepheids as 
standard candles requires two numbers, a slope and an intercept, 
it also assumes that log £ is a measureable quantity. It is not — 
what is observed is flux in a particular bandpass. 


Calibrating RR Lyr and Cepheid Variables 


In order to use RR Lyrae and Cepheids, the absolute luminosi- 
ties of these objects must be calibrated. In the case of RR Lyr 
stars, this means determining the star’s absolute magnitude (and 
perhaps how the absolute magnitude changes with metallicity). 
For Cepheids, one must determine both zero point and slope of 
the period-luminosity relation. 


Numerous methods are used to estimate the absolute magni- 
tudes of pulsating stars, including statistical parallax (whereby 
the Sun’s motion through space provides the baseline), main- 
sequence fitting of clusters which contain the stars, statistical 
parallax of globular cluster stars (where one observes the radial 
velocities of a set of stars in a globular cluster, and also measures 
their transverse proper motions), and pulsation theory. There is 
one especially notable technique: the Baade-Wesselink Method. 


The Baade-Wesselink Method 


Baade-Wesselink works for any pulsating star (or even exploding 
stars, such as supernovae). Consider a pusating star at minimum, 
with measured temperature, T}, and observed flux, fı. If the 
star’s radius at minimum is R,, then 


E 4r R?oT$ 


peno p (17.21) 


where D is the star’s distance. Later on at maximum, the star 
has observed flux, fo, a temperature To, and radius R2, so 


= An R20Ts 


h=- 722) 


The temperatures and fluxes are both observable, so there are 3 
unknowns (R1, R2, and D) and two equations. 


Now suppose you observe a star spectroscopically during its pulse 
from Rı to Rə. During this time, the star’s atmosphere has 
expanded at a velocity v(t) from Rı at time tı to Rə at time tə. 
So 
t2 
Ro = Ry + AR = Ry + J v(t) dt (17.23) 
ti 

If you measure v(t) throughout the pulse, you have a measure 
of AR. This gives you three equations and three unknowns, and 
let’s you solve for D. (This is left as an exercise to the student 
with nothing better to do.) 


In practice, the method is a bit more complicated. For example, 
how does one translate color to temperature? Can conversions 
which are derived from normal stars be applied to stars with 
dynamic atmospheres? In addition, how does the observed radial 








velocity of a stellar envelope relate to the true expansion velocity? 
The two are not equal. When you observe a star, you are getting 
flux over its entire surface, but only the very center of the star has 
a motion that is completely radial. The projection factor depends 
on a number of factors. When one moves from the center of a 
star to its limb, 

è Vops decreases as cos 0 

e the unit surface area increases as cos 0 

e limb darkening decreases the amount of flux received by the 
observer. 








From stellar atmospheres, the amount of limb darkening can be 
estimated via the Eddington approximation 


LO) = = cos + - (17.24) 


Putting this together yields 


L 
f J I(0) cos dw 


Vobs = 
ma (2 cos 6 + 2) cos? 6 sin 0d0 


jae (2 cos 6 + 2) cos 6 sin 6d0 


17 
= — |=141 17.2 
0) (=) v (17:25) 


[Gettings 1935, MNRAS, 95, 139] 


In practice, the calculation of the projection factor needs a model 
atmosphere analysis that involves the strength of the line being 
measured, the temperature of the star, and how one centroids 
the absorption line. 





The Slope of the Period-Luminosity Relation 


[Madore & Freedman 1991, Pub. A.S.P., 103, 933] 
[Freedman & Madore 1990, Ap.J., 365, 186] 
[Freedman et al. 2001, Ap.J., 553, 47] 


With work, the Baade-Wesselink method and other techniques 
can be used to obtain distances to individual stars. But in the 
Milky Way, there are only a handful (~ 10) Cepheids whose dis- 
tances (and therefore absolute magnitudes) with easily obtain- 
able distances. (Remember, Cepheids are Pop I objects, so they 
are usually located in star-forming regions in the plane of the 
galaxy, surrounded by patchy dust.) Thus, Galactic Cepheids 
are not ideal for determining the slope of the Period-Luminosity 
Relation. 





The best way to fix the slope is to observe a large sample of 
Cepheids at the same distance. Traditionally, this has been done 
in the Large Magellanic Cloud, our nearest (non dwarf) galaxy. 





Note the increased scatter in the P-L relation at bluer wave- 
lengths. Part of this is due to the larger effect of extinction. 
However, part of the scatter is intrinsic. The period of a Cepheid 
depends on both the star’s luminosity and its temperature, but 
because the instability strip is narrow, the latter variable is usu- 
ally neglected. However, consider the case of two stars on op- 
posite sides of the strip. When viewed in the optical, the small 
difference in temperature will propagate into a rather large dif- 
ference in flux within the bandpass (since you’re near the peak 
of the blackbody curve). This introduces scatter into the period- 
luminosity relation. On the other hand, if the two stars are 
viewed in the infrared, the flux within the bandpass doesn’t de- 
pend that much on temperature, hence the scatter is smaller. 








By observing Cepheids at multiple wavelengths, it is possible 
to remove all the effects of reddening. Consider a sample of 
Cepheids, extinguished by an amount of dust parameterized by 
E(B — V). The period-luminosity relation says that 


(Mz) = alog P +b (17.26) 


where x is the wavelength of the observation. When you observe 
Cepheids in another galaxy, you will observe their periods and 
their apparent magnitudes, (Mg). You therefore derive the ap- 
parent distance modulus, 4z, which is related to the true distance 
modulus, uo, by 








Hr = (Mz) = (Mz) = po + R, E(B => V) (17.27) 


In other words, you have one equation, but two unknowns, uo 
and E(B — V). However, if you observe at more than one wave- 
length, you have more than one equation (but still the same two 
unknowns). So you can solve for both the reddening and the 
distance simultaneously. 
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Fic, 4—The BRFRI apparent distance moduli for Cepheids in Baade's 
Fields 1, HI, and IY in M31 plotted against the inverse wavelength of the 


bandpass. The broken lines are Galactic extinction curves fitted to the appar- 
ent moduli by x? minimization. There is significantly different mean extinction 
from field to field, (with the 10 kpe Field III being most affected, and the 20 kpe 
Field IV being least affected); however, the extrapolation to zero extinction at 
infinite wavelength indicates remarkably little residual dispersion in true 
modulus. 





The Cepheid period luminosity relation probably depends a bit 
on the metallicity of the star. Unfortunately, not only is the 
amplitude of this effect controversial, but the is sign as well. 
The HST Distance Scale Key Project adopts a slight metallicity 
dependence, but probably shouldn’t have, since there is very little 
data supporting it. 








The SN 1987A Light Echo 


[Panagia, et al. 1991, Ap.J. (Letters), 380, L23] 
[Gould & Uza, O. 1998, Ap.J., 494, 118] 





Classically, the two most reliable standard candles are Cepheids 
and RR Lyrae stars. Yet the Cepheid distance modulus to the 
Large Magellanic is ~ 18.50, while the RR Lyrae star distance is 
~ 0.2 mag closer. Since the LMC is our nearest galaxy, this is 
disturbing. Fortunately, in 1987, a supernova went off near one 
of the star forming regions of the LMC. The observations of this 
supernovae have given us an independent distance to the galaxy. 





The facts of the case are these. In February 1987, SN 1987A 
erupted, sending gas expanding out at ~ 10,000 km s~'. The 
Hubble Space Telescope was not yet in orbit, but the super- 
novae was observed spectroscopically in the ultraviolet using the 
International Ultraviolet Explorer Telescope. Originally, IUE 
recorded only a bright continuum and very brought absorption 
lines from the supernova ejecta. But 83 + 6 days after outburst, 
the satellilte began to see a series of narrow emission lines. The 
narrow lines brightened continuously until day 413+ 24, at which 
time the emission plateaued. Since these lines were unresolved 
at IUE resolution, the emitting gas could not be ejecta from the 
supernova — it had to be from stationary surrounding material. 














Several years later, HST imaged the supernova. In addition to 
seeing the (still spatially unresolved) supernova remnant, HST 
detected a ring of emission surrounding the remnant. If you 
assume that this ring is circular, then the axis ratio of the ring 
(b/a = 0.73) implies an inclination of i = 43° + 2°. 


Now consider the following geometry. 











d 


Suppose the shell is circular, and that the emission lines seen by 
IUE came from the shell. The path difference between the light 
coming directly from the supernovae and the light echoing off the 
near side of the ring is 





Ap= R-—d=R- Rsini = R(1 — sini) (17.28) 
Thus the ring’s emission lines should have first appeared at time 
kebs ee (17.29) 

C 


The ring would then continue to brighten until the light from the 
farthest point of the ring reached us. 


R 
tə a to = 7 + sin i) (17.30) 


Using these two equations, the inclination of the ring would there- 
fore be 





(17.31) 


The physical size of the ring, R, then follows from (17.29), and 
the distance comes by comparing R to the observed size of the 
ring, 1766 + 003. 





When this is done, the derived distance to SN 1987A comes out 
to be 51.2 +3.1 kpc (or (m — M)o = 18.54 + 0.06a), in excellent 
agreement with the Cepheid distance. (This distance drops to 
50.1+3.1 kpc if one assumes that the supernova is on the far side 
of the galaxy.) Moreover, the inclination derived through (17.31) 
is 42° + 5°, almost exactly what is observed visually. This result 
presents extremely strong evidence that the Cepheid distance to 
the LMC is correct, and gets us to the nearest galaxy without 
the use of the distance ladder. Thus, the distance to the LMC 
is usually given as 50 kpc, or (m — M) = 18.50. This is what is 
used by the HST Distance Scale Key Project. 





Note however, that the above analysis assumes that the ring is 
perfectly circular the recombination associated with the ioniza- 
tion occurred instantaneously, and the delay times determined 
by IUE are accurate. A more complicated analysis Gould & 
Uza which uses a re-reduction of the JUE data, and takes this 
into account delays and non-circular geometry gives a distance 
modulus of (m — M)o < 18.37 + 0.04 for a circular ring, or 
(m— M) < 18.44+ 0.05 for a ring with an intrinsic axis ratio of 
b/a = 0.95. 


Secondary Standard Candles 


Standard candles can be divided into several types. First, there 
are primary distance candles, such as parallax, moving clus- 
ter, and Baade-Wesselink measurements. These require no as- 
sumptions about their calibration. Next, there are secondary 
standard candles, that require a calibration via some primary 
method. These secondary methods include main-sequence fit- 
ting, Cepheids and RR Lyrae variables; without parallax, etc., 
these would be useless. 





The techniques we will now talk about are, by and large, tertiary 
standard candles. Most of them are calibrated in nearby galaxies 
using Cepheids and/or RR Lyrae stars. Hence the uncertainties 
in the primary and secondary methods will propagate into these 
measurements. Moreover, the uncertainties in the testing of ter- 
tiary standard candles can also become an issue. 


There are two possible ways to test tertiary standard candles. 
The first, an external test, is compare the distances derived by 
one technique with those derived from another. If the two agree, 
then both methods are correct (or both are wrong in the same 
way). Alternatively, you can perform an internal test and derive 
distances to large numbers of galaxies at the same distance (say, 
a bunch of galaxies in a cluster). If no systematic error is seen 
(such as the blue galaxies being closer than the red galaxies), then 
you can feel somewhat confident that the method is producing 
good relative distances. 


Luminosity Functions 


It is very common in astronomy to study the number of objects 
present in a sample versus magnitude. Such a plot is called a 
luminosity function. There are several luminosity function stan- 
dard candles, including planetary nebulae, globular clusters, and 
red giant stars. In general, these methods do not go out far 
enough to be used to measure the undisturbed Hubble Flow, but 
they are useful for measuring distances to early-type galaxies 
(which have no Cepheids) or checking for systematic errors on 
the distance ladder. 





Luminosity functions are extremely useful in astronomy, not only 
for distance measurements, but for a whole host of other prob- 
lems, from the the distribution of Milky Way field stars to the 
evolution of galaxies in the universe. Thus, before addressing the 
individual techniques, we should first consider them in a general 
sense. 





There are two ways to think of a luminosity function. The con- 
ventional way is just to consider it as a function, N(m), where 
N is the number of objects within a finite bin size. (Actually, 
one almost always uses log N, rather than N.) Under this inter- 
pretation, one chooses the bin size, and creates the luminosity 
function by placing each object into its appropriate bin. Note, 
however, that due to observational errors, the observed luminos- 
ity function will not be the intrinsic function. 


To see this, consider an intrinsic power-law luminosity function, 
where each bin has 10 times the counts in the previous bin, t.e., 
bin k has 10 counts, bin k + 1 has 100 counts, bin k + 2 has 1000 
counts, etc. In the presence of observational errors, some of the 
counts from each bin will spill over into the adjacent bins, so bin 





k +1 will lose a fraction of its counts to bins k and k+2, but also 
gain counts from these bins. But note: all is not symmetrical. If 
bin k + 1 loses 5% of its counts to bin k + 2, but gains 5% of bin 
k + 2’s counts, then bin k + 1 is a big winner. We will therefore 
overestimate its numbers. The observed luminosity function is a 
convolution of the true function with the photometric error func- 
tion. In general, the observed function will always be smoother 
and flatter than the true function. 


The Eddington Correction 
[Eddington 1913, MNRAS, 73, 359] 


There’s a very tricky way to handle observational errors, which 
was first used by Eddington in 1913, and which is sometimes 
called the Eddington correction. (By far the best place to read 
about it is the classic 1953 book Statistical Astronomy, by Trum- 
pler and Weaver.) Let Nobs(Mmo) be the observed number of 
counts in a bin centered at mo, and N;(m) be the true lumi- 
nosity function. To save on notation, let £x = m — mo, and G(x) 
be the function which describes how the true number of counts 
in the bin centered at mg is distributed. For simplicity, let’s 
also assume that G(x) is symmetrical about zero, and, of course, 
normalizes to one, since no objects are lost. (Usually, you can 
consider G to be a Gaussian, with some dispersion, o). With this 
notation, Nobs is just the convolution of N; with G(s), 


Nas = Nit © G (18.01) 


and 
Nobs (Mo) = I Ni(mo — x) G(x) dx (18.02) 


Now, the key to solving this equation is to expand N; about mo 
using a Taylor series 


Nobs(™mo) = T f Nelo) = Neo), $ a +... i 





G(ax)dx (18.03) 


or 





Novs(mo) = Ni (mo) f G(x) dx — Netto) I x G(x) da+ 


— CO 


a“ i. e de Pend, (18.04) 


— OO 


Note that integrals of the form 


fin T x”G(x)dz (18.05) 


—Co 


give the moments of G. So (18.05) can be re-written as 


Nobvs(™mo) = Ne(mo) — N; (mo) T ST Ny (mo) —... (18.06) 


This is a perfectly good equation for Nobs (Mo), except that you 
don’t know N;(mg), or its derivatives. But it is possible to be 
tricky, and say that for some set of variables, A, the following 
must be true 


Ni (mo) = Novs(Mo) + A1 Nrs (Mo) + A> Nl, (mMo) Anaa (18.07) 


Now substitute for N;(mo) in (18.06) using (18.07), and use 
(18.05) to find expressions for N’,.(mo), N’.,(mo), etc. In or- 
der for Nops(mo) to equal Nobs(Mo), each term must equal its 


corresponding term. So 


Hy 
Ai— 7, =9 
H2 
Ag+ 5 =0 
M3 
As-  =0 
Avi As ee =G (18.08) 


2! A! 


This gives us the values of A. Finally, there is one last simplifi- 
cation. If G(x) is symmetrical, all the odd numbered terms go 
to zero. (You can see this easily if you graph what’s going on). 


So 


H2 H2Ņ\? pa iv 
Ni (mo) = Noss (mo) = SENS mo) + 4 (B2)? — Ht À Nig (mo) 


TE (18.09) 


And note: if G(x) is a Gaussian function, Un = 0". So, if we just 
keep the first term 


2 
o 
N, (mo) = Novs(mo) — 5r Noss (Mo) (18.10) 
Therefore, if you know your observational error, o, and have 
some idea of the shape of the luminosity function, you can easily 
correct your data and arrive at the true function. 


Luminosity Functions as Probability Distributions 


A second way of looking at a luminosity function is to consider it 
as a probability distribution. Let ¢(/) be the intrinsic luminos- 
ity function of a set of objects, convolved with the photometric 
errors. (You may not know what this luminosity function is, but 
that’s OK for now.) Since you’re treating the luminosity function 
as a probability distribution, 


[- ¢(M)dM =1 (18.11) 


Now let T be the total number of objects you are observing, and 
consider: how many objects are you likely to observe in a bin of 
size dM? Your expectation value is simply proportional to the 
probability amplitude at that bin, t.e., 


\(M)dM = T¢(M)dM (18.12) 


(So, if you’re near the peak of a Gaussian, you’ll observe more 
objects in a magnitude bin than you would on the tail.) Of 
course, the actual number of objects you will observe won’t be 
A(M)dM. Sometimes you’ll see more, sometimes less. Since the 
scatter should be Poissonian about the expectation value, the 
probability of observing n objects in a bin is 


p(n) = CLL eae (18.13) 


But that’s just for one bin. The total probability of finding nı 
objects in bin 1, nə objects in bin 2, etc. out to bin N is 


P=T[ritn) (18.14) 


This is a generalized likelihood: it tells you the probability of 
observing a given distribution of objects. 


Taken alone, (18.14) doesn’t seem too useful. After all, the prob- 
ability of finding any particular configuration of objects is rather 
small. However, one can then ask the question, “which is more 
likely, luminosity function ¢, or ¢2?” That is easily answered 
(i.e., , 1 is twice as likely as ¢2). Thus, (18.14) can be thought 
of as being an equation for relative likelihood. 


If one has thousands of objects, then (18.14) can be used directly 
to compute relatively likelihoods between many different models 
of (M). But the real power of the analysis occurs if the number 
of objects is small. To see this, let dM — 0. With a infinitessimal 
bin size, each bin will have either zero or one object in it. We 
can therefore re-write (18.14) as 


M)dM)° e —\(M)dM \(M)dM)' e7>M) 4M 
oe ny} Ee \ ia ) 3 


(18.15) 
where the first term represents all those bins with zero objects, 
and the second gives all the bins with one object. Now substitute 
using (18.12), factor out the exponential, 


P= I ene) 


and note that since dM — 0, the first term is really an integral, 
going from the brightest objects, down to the faintest object you 
can see, Mı. So 


P=exp{— f —~T¢(M am Trona (18.17) 


Mı 2 





JI {T¢(M;)dM},_, (18.16) 


all bins 


The first term in the equation is just a normalization; the second 
term is just the product of the individual probabilities. 


Let’s now apply this to a specific example: determining distance. 
We can translate absolute magnitude to apparent magnitude via 
M =m — u, so 


peop |- / = -rø(m)am | I Anessa: RAS) 


m1 — pb 


What is the most likely distance modulus, u? It’s the value that 
produces the highest relatively likelihood, P. 


Three additional notes. First, from a computaional point of view, 
it is usually easier to work in terms of log probabilities. In that 
case, the products in the above equations become summations, 
beers 
=e N 
apas Í -T¢(M)dM+NInT+) In o(m;—p) (18.19) 


IS i=i 


(where we’ve dropped the Nlndm term, since it’s just a con- 
stant). Second, if one were to investigate all possible distance 
moduli (and all possible values of T), then you can re-normalize 
the likelihoods to sum to one, since the solution must be in there 
somewhere. Finally, this type of analysis doesn’t tell you whether 
any of the fits are good or bad; it just let’s you know which are 
best. 


Some problems in astronomy are simple when you think of lumi- 
nosity functions as probability functions, and impossible if you 
work with them as luminosity functions. (For instance, how do 
you bin a total of 5 objects?) And the reverse is also true. So 
both tools are needed. 


Power-law Luminosity Functions 


Consider a luminosity function which is a power law, i.e., N(L) « 
L” yor 
N(m) =am+b (18.20) 


where a and b are two constants. In general, of course, these 
constants are not known ahead of time. (In fact, they are gen- 
erally what you are looking for.) Now, let’s observe two galaxies 
that look identical on the sky, but let one be 10 times further 
away than the other. Obviously, the objects in the more dis- 
tant galaxy will five magnitudes fainter than that of the nearer 
galaxy. But that more distant galaxy (in order to have the same 
apparent magnitude) would have to be 100 times more luminous. 
Therefore, the number of objects at each magnitude would be 100 
times greater. Consequently, the log N vs. m diagram for both 
galaxies would be the same! This is a general rule — you can’t 
tell anything from a power-law luminosity distribution. 
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The Globular Cluster Luminosity Function 


When put at the distance of Z 1 Mpc away, globular clusters look 
like a faint stars. But some galaxies have alot of them, mak- 
ing them easy to count, even with background contamination. 
(This is especially true in smooth, elliptical and SO systems.) In 
most galaxies, the GCLF is a log Gaussian, i.e., when plotted as 
log N vs. m, it is a Gaussian, 


log N(m) « eM Mo) 20° (18.21) 


with Mo ~ —7.3 in the V-band, and o ~ 1.2. Thus by observing 
the apparent magnitude of the GCLF peak, mo, it’s possible to 
estimate distance. 
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The observed globular cluster luminosity function for M87, and 
the best-fit Gaussian. 


Tip of the Red Giant Branch 


According to stellar evolution, when a star runs out of hydrogen 
in its core, it develops an inert helium core and evolves to the 
red in the HR diagram. Once on the giant branch, the stars 
increase their brightness, until helium itself fuses, via the triple- 
a process. Due to the physics of stellar interiors, this fusion 
takes place at about the same absolute luminosity for all stars 
with initial masses less than ~ 2.5Mo. Thus, the tip of the red 
giant branch (TRGB) can be a useful standard candle. 


Detecting individual red giants in distant galaxies is difficult; 
measuring their brightness is even harder. (Galaxies have many, 
many, many red giants, and their fluxes blur together.) In addi- 
tion, although the total luminosity of the red giant stars do not 
depend on their mass or metallicity, the flux in a particular filter 
does. (For example, metal-poor TRGB stars are more luminous 
in the J band than metal-rich TRGB stars.) Nevertheless, by 
finding the apparent magnitude of the RGB tip, and comparing 
it to the absolute magnitude of the TRGB in galactic globular 
clusters, distances can be estimated. 


The Planetary Nebula Luminosity Function 


[Jacoby 1989, Ap.J., 339, 39] 
[Ciardullo et al. 1989, Ap.J., 339, 53] 
[Ciardullo et al. 2002, Ap.J., 577, 31] 


When seen in other galaxies, planetary nebulae are point sources. 
The central stars of bright PN are stars that have just pealed off 
the asymptotic giant branch, and are moving across the HR di- 
agram (to the blue), and about to stop producing energy and 
become white dwarfs. In the meantime, however, they are ex- 
tremely bright (i.e., just as luminous as they were on the tip of 
the AGB) and extremely hot. Most of their energy is trapped 
by their surrounding nebulae, and goes into ionizing hydrogen. 
This energy is released in a series of emission lines, the brightest 
of which (usually by far) is [O IIT] 45007. 
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When you observe a PN in a distant galaxy, you will see light at 
5007 Å, but no light at adjacent wavelengths. (This is in contrast 
to stars which are continuum sources.) Therefore, they are easy 
to identify. 








The premise behind the planetary nebula luminosity function is 
that the luminosity function of PN in [O ITI] 45007 is always the 


same 
N(m) x e930 {1 ee A (18.22) 


where M* is the magnitude of the brightest possible PN. Since 
observations of PN are not done in any “standard” broadband 
system, the conversion between PN flux and PN magnitude is 
rather arbitrary (i.e., , was invented on the spare of the moment 
one Thursday morning in 1988) 


™m5007 = —2.5 log Fs007 — 13.74 (18.23) 


where F007 is the flux contained in the [O ITI] A5007 emission 


line in ergs cm~? s71. 


The PNLF technique has been tested rather thoroughly, both 
against other “reliable” techniques (such as Cepheid distances), 
and using multiple galaxies within a cluster. The results indicate 
the PNLF does not depend on the color, metallicity, or age of 
the galaxy, and that, PNLF measurements are good to ~ 10% 
or better. (The lack of dependence on metallicity is understand- 
able; since OTT is the major coolant in the nebula, decreasing its 
abundance only serves to drive up the electron temperature and 
increase the emission per ion. The invariance with population 
age is more of a mystery, and probably has something to do with 
blue straggler stars.) 








The PNLF is the only commonly used technique that works 
equally well in both spiral and elliptical galaxies. It is therefore 
extremely for cross-checking the results of different methods, and 
looking for systematic errors in the distance ladder. 


Surface Brightness Fluctuations 


[Tonry & Schneider 1988, A.J., 96, 807] 
[Tonry et al. 2001, Ap.J., 546, 681] 
[Jensen et al. 2003, Ap.J., 583, 712] 


Like the PNLF method, the surface brightness fluctuation (SBF) 
technique was also invented in 1988. Unlike the PNLF, it has the 
capability of reaching galaxies in the unperturbed Hubble flow. 


Consider the isophotes (contour of equal surface brightness) of a 
smooth elliptical or SO galaxy. Let n be the surface density of 
stars along the isophote, £ be the “mean” stellar luminosity, and 
r be the distance to the galaxy. The mean flux recorded in each 
pixel of your detector is, of course, independent of distance: as 
you move a galaxy further away, the flux from each star declines 
as 1/r?, but the number of stars per unit area on the sky increases 
as r°. So the mean flux (number of stars times flux from each 
star) is 

(F) x (nr?)(L£/r?) xnl (18.24) 
In reality, the number of stars in each pixel won’t be exactly the 
same — there will be some statistical fluctuation in the stellar 
density due to Poisson statistics. (In other words, if the average 
number of stars per pixel is 100, the expected pixel-to-pixel fluc- 
tuation will be 100, or 10%.) Consequently, the pixel-to-pixel 
flux will have a standard deviation of 


our) x (nr?)!/?(L/r?) (18.25) 
a variance of 
oC py x (nr?) (L/r?)? x nL Jr? (18.26) 
and a fractional variance of 
oF T aoa 
es ee nL (18.27) 


(Note that this, in fact, defines what “mean luminosity” means) 
n D N Li 
E ye Ni Li 


Thus, by measuring the fractional pixel-to-pixel “noise”, it is 
possible to estimate the distance, r. 


L (18.28) 


Of course, the method has its difficulties. Pixel-to-pixel noise 
comes from a variety of sources, including read-noise of the de- 
tector, shot-noise from photon statistics, noise from small knots 
of emission and/or dust, and noise from faint point sources (such 
as foreground stars, background galaxies, and the galaxy’s glob- 
ular clusters). So each of these must be modeled and removed. 


In addition, the method assumes that £ is the same for every 
galaxy. It is not. The value of £ (or equivalently M) is fainter 
in redder galaxies than it is in bluer galaxies. (It is also band- 
pass dependent!) The exact calibration of the SBF fluctuation 
magnitude has gone through many iterations (particularly in the 
I-band), and was given by Tonry et al. (2001) as 


M = —1.74 + 4.5 [(V — I)o — 1.15] (18.29) 


where 1.0 < (V — I)o < 1.3 is the intrinsic color of the galaxy 
and My is the surface brightness fluctuation as measured in the 
I-band. (Galaxies don’t get much redder than this, while bluer 
galaxies generally have dust and star formation which makes the 
measurement of surface brightness fluctuations impossible. So 
M; is calibrated only over a finite range.) 


In theory, surface brightness fluctuations can be measured in any 
filter bandpass (and it may be best to measure it in the infrared, 
where M is brightest), but the largest set of measurements have 
been done in J. 


The Results from Surface Brightness Fluctuations 


Over 300 galaxies have SBF distance measurements. Since many 
of these galaxies are distant enough to be in the unperturbed 
Hubble Flow, we should be able to obtain a Hubble Constant 
from its measurements. Unfortunately, doing so demonstrates 
the potential problems with Hubble Constant measurements. 


The SBF method is a tertiary standard candle, so it must be 
calibrated with another technique. ‘The most direct way to do 
this is through pulsating stars. RR Lyr variables aren’t suitable 
since they’re bright enough to be seen outside the Local Group. 
That leaves Cepheid variables. Unfortunately, Cepheids are Pop 
I objects, while SBF measurements can only be made in smooth 
galaxies without any dust (i.e., ellipticals and S0’s). Thus, in 
order to calibrate the SBF measurements with Cepheids, either 
use a) different galaxies in a group (and assume they’re all at the 
same distance, or b) spiral galaxies with sizeable bulges. 





There are 6 galaxies that satisfy the latter condition; if you cali- 
brate the SBF with these galaxies, you get Hp = 70+5 km/s/Mpc 
(with an additional zero point uncertainty of +6). This is the 
calibration that was used by the HST Key Project. However, a 
revised calibration based on infrared SBF measurements yield a 
value closer to Ho = 80 + 5 km/s/Mpc. 
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Cepheid Distance Modulus 


The Cepheid calibration of the SBF method. The plot shows the 
residuals between the mean calibration and the distance derived 
for each galaxy. 


However, SBF distances can also be calibrated indirectly via the 
PNLF (i.e., Cepheid — PNLF — SBF). Since planetary nebu- 
lae are found in all galaxies, there is substantial overlap in the 
techniques. To date, 13 galaxies have both PNLF and Cepheid 
measurements, and 28 galaxies have PNLF and SBF data. If 
you compute Ho in this way, then Hp is 15% larger. Much of 
this offset has now been explained — the zero point of the SBF 
method changed yet again. 
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The Cepheid calibration of the PNLF cutoff magnitude, M*. The 
galaxies’ Cepheid distances are used to translate the apparent 
magnitude of the cutoff, m*, to absolute magnitude. 
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The difference between the PNLF-derived distance modulus and 





SBF-derived distance modulus for 28 galaxies measured with 
both techniques. ‘The scatter is consistent with the expected 
errors (the curve), but the mean is quite different — 0.3 mag, or 


15%. 





